Google Scholar’s Ranking Algorithm: An Introductory Overview
نویسندگان
چکیده
Google Scholar is one of the major academic search engines but its ranking algorithm for academic articles is unknown. We performed the first steps to reverse-engineering Google Scholar’s ranking algorithm and present the results in this research-in-progress paper. The results are: Citation counts is the highest weighed factor in Google Scholar’s ranking algorithm. Therefore, highly cited articles are found significantly more often in higher positions than articles that have been cited less often. As a consequence, Google Scholar seems to be more suitable for finding standard literature than gems or articles by authors advancing a new or different view from the mainstream. However, interesting exceptions for some search queries occurred. Moreover, the occurrence of a search term in an article’s title seems to have a strong impact on the article’s ranking. The impact of search term frequencies in an article’s full text is weak. That means it makes no difference in an article’s ranking if the article contains the query terms only once or multiple times. It was further researched whether the name of an author or journal has an impact on the ranking and whether differences exist between the ranking algorithms of different search modes that Google Scholar offers. The answer in both of these cases was "yes". The results of our research may help authors to optimize their articles for Google Scholar and enable researchers to estimate the usefulness of Google Scholar with respect to their search intention and hence the need to use further academic search engines or databases. Academic Search Engines, Google Scholar, Ranking Algorithm, Research in Progress
منابع مشابه
Putting Google Scholar to the test: a preliminary study
Purpose – To describe a small-scale quantitative evaluation of the scholarly information search engine, Google Scholar. Design/methodology/approach – Google Scholar’s ability to retrieve scholarly information was compared to that of three popular search engines: Ask.com, Google and Yahoo! Test queries were presented to all four search engines and the following measures were used to compare them...
متن کاملClassic papers: déjà vu, a step further in the bibliometric exploitation of Google Scholar
After giving a brief overview of Eugene Garfield’s contributions to the issue of identifying and studying the most cited scientific articles, manifested in the creation of his Citation Classics, the main characteristics and features of Google Scholar’s new service -Classic Papers-, as well as its main strengths and weaknesses, are addressed. This product currently displays the most cited Englis...
متن کاملThe Depth and Breadth of Google Scholar: An Empirical Study
The introduction of Google Scholar in November 2004 was accompanied by fanfare, skepticism, and numerous questions about the scope and coverage of this database. Nearly one year after its inception, many of these questions remain unanswered. This study compares the contents of 47 different databases with that of Google Scholar. Included in this investigation are tests for Google Scholar publica...
متن کاملA Review Paper on Page Ranking Algorithms
Page Rank is extensively used for ranking web pages in order of relevance by mostly all search engines world-wide. There are many algorithms for page ranking such as Google Page Rank algorithm, Hyperlink-Induced Topic Search (HITS) algorithm etc. Some search engine uses link structure based page ranking algorithm while some uses content based. The page ranking algorithm reflects the popularity ...
متن کاملDamping factor in Google page ranking
Google, the largest search engine worldwide, adopts PageRank technology to determine the rank of website listings. This paper describes how damping factor is a critical factor in changing a website’s ranking in traditional Google PageRank technology. A modified algorithm based on input–output ratio concept is proposed to substitute for the damping factor. Besides there is no need to choose an o...
متن کامل